skip to main content


Search for: All records

Creators/Authors contains: "Yu, Han"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Background

    As mobile health (mHealth) studies become increasingly productive owing to the advancements in wearable and mobile sensor technology, our ability to monitor and model human behavior will be constrained by participant receptivity. Many health constructs are dependent on subjective responses, and without such responses, researchers are left with little to no ground truth to accompany our ever-growing biobehavioral data. This issue can significantly impact the quality of a study, particularly for populations known to exhibit lower compliance rates. To address this challenge, researchers have proposed innovative approaches that use machine learning (ML) and sensor data to modify the timing and delivery of surveys. However, an overarching concern is the potential introduction of biases or unintended influences on participants’ responses when implementing new survey delivery methods.

    Objective

    This study aims to demonstrate the potential impact of an ML-based ecological momentary assessment (EMA) delivery system (using receptivity as the predictor variable) on the participants’ reported emotional state. We examine the factors that affect participants’ receptivity to EMAs in a 10-day wearable and EMA–based emotional state–sensing mHealth study. We study the physiological relationships indicative of receptivity and affect while also analyzing the interaction between the 2 constructs.

    Methods

    We collected data from 45 healthy participants wearing 2 devices measuring electrodermal activity, accelerometer, electrocardiography, and skin temperature while answering 10 EMAs daily, containing questions about perceived mood. Owing to the nature of our constructs, we can only obtain ground truth measures for both affect and receptivity during responses. Therefore, we used unsupervised and supervised ML methods to infer affect when a participant did not respond. Our unsupervised method used k-means clustering to determine the relationship between physiology and receptivity and then inferred the emotional state during nonresponses. For the supervised learning method, we primarily used random forest and neural networks to predict the affect of unlabeled data points as well as receptivity.

    Results

    Our findings showed that using a receptivity model to trigger EMAs decreased the reported negative affect by >3 points or 0.29 SDs in our self-reported affect measure, scored between 13 and 91. The findings also showed a bimodal distribution of our predicted affect during nonresponses. This indicates that this system initiates EMAs more commonly during states of higher positive emotions.

    Conclusions

    Our results showed a clear relationship between affect and receptivity. This relationship can affect the efficacy of an mHealth study, particularly those that use an ML algorithm to trigger EMAs. Therefore, we propose that future work should focus on a smart trigger that promotes EMA receptivity without influencing affect during sampled time points.

     
    more » « less
    Free, publicly-accessible full text available January 1, 2025
  2. Shift work disrupts sleep and causes chronic stress, resulting in burnout syndrome characterized by emotional exhaustion, depersonalization, and decreased personal accomplishment. Continuous biometric data collected through wearable devices contributes to mental health research. However, direct prediction of burnout risk is still limited, and interpreting machine learning (ML) models in healthcare poses challenges. In this paper, we develop machine learning models that utilize wearable and survey data, including rhythm features, to predict burnout risk among shift workers. Additionally, we employ the DiCE (Diverse Counterfactual Explanations) framework to generate interpretable explanations for the ML model, aiding in the management of burnout risks. Our experiments on the AMED dataset show that incorporating rhythm features significantly enhances the predictive performance of our models. Specifically, sleep and heart rate features have emerged as significant indicators for accurately predicting burnout risk 
    more » « less
    Free, publicly-accessible full text available November 15, 2024
  3. Physiological and behavioral data collected from wearable or mobile sensors have been used to estimate self-reported stress levels. Since stress annotation usually relies on self-reports during the study, a limited amount of labeled data can be an obstacle to developing accurate and generalized stress-predicting models. On the other hand, the sensors can continuously capture signals without annotations. This work investigates leveraging unlabeled wearable sensor data for stress detection in the wild. We propose a two-stage semi-supervised learning framework that leverages wearable sensor data to help with stress detection. The proposed structure consists of an auto-encoder pre-training method for learning information from unlabeled data and the consistency regularization approach to enhance the robustness of the model. Besides, we propose a novel active sampling method for selecting unlabeled samples to avoid introducing redundant information to the model. We validate these methods using two datasets with physiological signals and stress labels collected in the wild, as well as four human activity recognition (HAR) datasets to evaluate the generality of the proposed method. Our approach demonstrated competitive results for stress detection, improving stress classification performance by approximately 7% to 10% on the stress detection datasets compared to the baseline supervised learning models. Furthermore, the ablation study we conducted for the HAR tasks supported the effectiveness of our methods. Our approach showed comparable performance to state-of-the-art semi-supervised learning methods for both stress detection and HAR tasks. 
    more » « less
    Free, publicly-accessible full text available June 12, 2024
  4. Free, publicly-accessible full text available September 10, 2024
  5. Abstract

    The noniterative conditional expectation (NICE) parametric g-formula can be used to estimate the causal effect of sustained treatment strategies. In addition to identifiability conditions, the validity of the NICE parametric g-formula generally requires the correct specification of models for time-varying outcomes, treatments, and confounders at each follow-up time point. An informal approach for evaluating model specification is to compare the observed distributions of the outcome, treatments, and confounders with their parametric g-formula estimates under the “natural course.” In the presence of loss to follow-up, however, the observed and natural-course risks can differ even if the identifiability conditions of the parametric g-formula hold and there is no model misspecification. Here, we describe 2 approaches for evaluating model specification when using the parametric g-formula in the presence of censoring: 1) comparing factual risks estimated by the g-formula with nonparametric Kaplan-Meier estimates and 2) comparing natural-course risks estimated by inverse probability weighting with those estimated by the g-formula. We also describe how to correctly compute natural-course estimates of time-varying covariate means when using a computationally efficient g-formula algorithm. We evaluate the proposed methods via simulation and implement them to estimate the effects of dietary interventions in 2 cohort studies.

     
    more » « less
  6. Abstract

    Microbially-synthesized protein-based materials are attractive replacements for petroleum-derived synthetic polymers. However, the high molecular weight, high repetitiveness, and highly-biased amino acid composition of high-performance protein-based materials have restricted their production and widespread use. Here we present a general strategy for enhancing both strength and toughness of low-molecular-weight protein-based materials by fusing intrinsically-disordered mussel foot protein fragments to their termini, thereby promoting end-to-end protein-protein interactions. We demonstrate that fibers of a ~60 kDa bi-terminally fused amyloid-silk protein exhibit ultimate tensile strength up to 481 ± 31 MPa and toughness of 179 ± 39 MJ*m−3, while achieving a high titer of 8.0 ± 0.70 g/L by bioreactor production. We show that bi-terminal fusion of Mfp5 fragments significantly enhances the alignment of β-nanocrystals, and intermolecular interactions are promoted by cation-π and π-π interactions between terminal fragments. Our approach highlights the advantage of self-interacting intrinsically-disordered proteins in enhancing material mechanical properties and can be applied to a wide range of protein-based materials.

     
    more » « less
  7. Background Shift workers are at high risk of developing sleep disorders such as shift worker sleep disorder or chronic insomnia. Cognitive behavioral therapy (CBT) is the first-line treatment for insomnia, and emerging evidence shows that internet-based CBT is highly effective with additional features such as continuous tracking and personalization. However, there are limited studies on internet-based CBT for shift workers with sleep disorders. Objective This study aimed to evaluate the impact of a 4-week, physician-assisted, internet-delivered CBT program incorporating machine learning–based well-being prediction on the sleep duration of shift workers at high risk of sleep disorders. We evaluated these outcomes using an internet-delivered CBT app and fitness trackers in the intensive care unit. Methods A convenience sample of 61 shift workers (mean age 32.9, SD 8.3 years) from the intensive care unit or emergency department participated in the study. Eligible participants were on a 3-shift schedule and had a Pittsburgh Sleep Quality Index score ≥5. The study comprised a 1-week baseline period, followed by a 4-week intervention period. Before the study, the participants completed questionnaires regarding the subjective evaluation of sleep, burnout syndrome, and mental health. Participants were asked to wear a commercial fitness tracker to track their daily activities, heart rate, and sleep for 5 weeks. The internet-delivered CBT program included well-being prediction, activity and sleep chart, and sleep advice. A job-based multitask and multilabel convolutional neural network–based model was used for well-being prediction. Participant-specific sleep advice was provided by sleep physicians based on daily surveys and fitness tracker data. The primary end point of this study was sleep duration. For continuous measurements (sleep duration, steps, etc), the mean baseline and week-4 intervention data were compared. The 2-tailed paired t test or Wilcoxon signed rank test was performed depending on the distribution of the data. Results In the fourth week of intervention, the mean daily sleep duration for 7 days (6.06, SD 1.30 hours) showed a statistically significant increase compared with the baseline (5.54, SD 1.36 hours; P=.02). Subjective sleep quality, as measured by the Pittsburgh Sleep Quality Index, also showed statistically significant improvement from baseline (9.10) to after the intervention (7.84; P=.001). However, no significant improvement was found in the subjective well-being scores (all P>.05). Feature importance analysis for all 45 variables in the prediction model showed that sleep duration had the highest importance. Conclusions The physician-assisted internet-delivered CBT program targeting shift workers with a high risk of sleep disorders showed a statistically significant increase in sleep duration as measured by wearable sensors along with subjective sleep quality. This study shows that sleep improvement programs using an app and wearable sensors are feasible and may play an important role in preventing shift work–related sleep disorders. International Registered Report Identifier (IRRID) RR2-10.2196/24799. 
    more » « less